Discovering Informative Subgraphs in Rdf Graphs
نویسندگان
چکیده
In most contemporary approaches to pattern discovery in graphs, either quantitative anomalies or frequency of substructure is used to measure the relevance of a pattern. In this thesis, we address the issue of discovering informative subgraphs within RDF graphs. In the context of Semantic Search, relevance of such subgraphs depends on the amount of useful information conveyed to a user. This in turn depends on the meaning (semantics) of the edges in the subgraph. We introduce heuristics that guide a discovery algorithm away from banal (both low information and low relevance) paths towards more informative and relevant ones. This guidance is based on weighting mechanisms (driven by relationships) for the edges in the RDF graph. We present an analysis of the quality of the generated subgraphs with respect to path ranking metrics. We then conclude by presenting intuitions about which of our weighting schemes and heuristics produce higher quality subgraphs.
منابع مشابه
Benchmarks for SPARQL Property Paths Bachelorarbeit
The Resource Description Framework (RDF) is a triple based representation of directed graphs with labelled edges. With the emergence of RDF graphs special databases, called RDF stores, were developed. In order to query graphs, which are stored in these RDF stores, the query language SPARQL Protocol And Query Language (SPARQL) is used. With the help of this query language it is possible to descr...
متن کاملSimilar Structures inside RDF-Graphs
RDF is the common data model to publish structured data on the Web. RDF data sets are given as subject-predicateobject triples and typically are represented as directed edgelabeled graphs. To make the information represented by such graphs comprehensible, RDF-schema (RDFS) provides concepts to define a class-structure as part of the given RDFgraph and thus supports a more abstract view on the d...
متن کاملk - RDF-Neighbourhood Anonymity: Combining Structural and Attribute-based Anonymisation for Linked Data
We provide a new way for anonymising a heterogeneous graph containing personal identifiable information. The anonymisation algorithm is called k− RDF-neighbourhood anonymity, because it changes the one hoop neighbourhood of at least k persons inside an RDF graph so that they cannot be distinguished. This enhances the privacy of persons represented in the graph. Our approach allows us to control...
متن کاملDesigning Indexing Structure for Discovering Relationships in RDF Graphs
Discovering the complex relationships between entities is one way of benefitting from the Semantic Web. This paper discusses new approaches to implementing ρ-operators into RDF querying engines which will enable discovering such relationships viable. The cornerstone of such implementation is creating an index which describes the original RDF graph. The index is created in two steps. Firstly, it...
متن کاملRDF Knowledge Graph Visualization From a Knowledge Extraction System
In this work, we present a system to visualize RDF knowledge graphs. These graphs are obtained from a knowledge extraction system designed by GEOLSemantics. This extraction is performed using natural language processing and Trigger Detection. The user can visualize subgraphs by selecting some ontology features like concepts or individuals. The system is also multilingual, with the use of the an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005